Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Artigo em Inglês | MEDLINE | ID: mdl-34547062

RESUMO

The most popular model for the search of ChIP-seq data for transcription factor binding sites (TFBS) is the positional weight matrix (PWM). However, this model does not take into account dependencies between nucleotide occurrences in different site positions. Currently, two recently proposed models, BaMM and InMoDe, can do as much. However, application of these models was usually limited only to comparing their recognition accuracies with that of PWMs, while none of the analyses of the co-prediction and relative positioning of hits of different models in peaks has yet been performed. To close this gap, we propose the pipeline called MultiDeNA. This pipeline includes stages of model training, assessing their recognition accuracy, scanning ChIP-seq peaks and their classification based on scan results. We applied our pipeline to 22 ChIP-seq datasets of TF FOXA2 and considered PWM, dinucleotide PWM (diPWM), BaMM and InMoDe models. The combination of these four models allowed a significant increase in the fraction of recognized peaks compared to that for the sole PWM model: the increase was 26.3 %. The BaMM model provided the main contribution to the recognition of sites. Although the major fraction of predicted peaks contained TFBS of different models with coincided positions, the medians of the fraction of peaks containing the predictions of sole models were 1.08, 0.49, 4.15 and 1.73 % for PWM, diPWM, BaMM and InMoDe, respectively. Thus, FOXA2 BSs were not fully described by only a sole model, which indicates theirs heterogeneity. We assume that the BaMM model is the most successful in describing the structure of the FOXA2 BS in ChIP-seq datasets under study.

2.
Dokl Biochem Biophys ; 485(1): 95-100, 2019 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-31201623

RESUMO

This is the first study to investigate the molecular-genetic organization of polytene chromosome interbands located on both molecular and cytological maps of Drosophila genome. The majority of the studied interbands contained one gene with a single transcription initiation site; the remaining interbands contained one gene with several alternative promoters, two or more unidirectional genes, and "head-to-head" arranged genes. In addition, intricately arranged interbands containing three or more genes in both unidirectional and bidirectional orientation were found. Insulator proteins, ORC, P-insertions, DNase I hypersensitive sites, and other open chromatin structures were situated in the promoter region of the genes located in the interbands. This area is critical for the formation of the interband, an open chromatin region in which gene transcription and replication are combined.


Assuntos
Elementos de DNA Transponíveis , Genes Essenciais , Cromossomos Politênicos/genética , Regiões Promotoras Genéticas , Animais , Drosophila melanogaster
3.
Dokl Biochem Biophys ; 483(1): 297-301, 2018 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-30607724

RESUMO

New data on the organization of genes entirely located in the open domains for chromatin transcription and occupying only one chromosome structure (interband) were obtained. The characteristic features of these genes are the small size (on average, 1-2 kb), depletion of the replicative complex proteins in the regulatory region, and the presence of specific motifs for binding transcription factors, as compared to the genes occupying two structures (interband and gray band). The biological function of these genes is associated primarily with the processes of gene expression and RNA metabolism.


Assuntos
Cromatina , Proteínas de Ligação a DNA , Proteínas de Drosophila , Regulação da Expressão Gênica/fisiologia , Cromossomos Politênicos , RNA , Animais , Cromatina/genética , Cromatina/metabolismo , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Proteínas de Drosophila/genética , Proteínas de Drosophila/metabolismo , Drosophila melanogaster , Cromossomos Politênicos/genética , Cromossomos Politênicos/metabolismo , RNA/biossíntese , RNA/genética
4.
Sci Rep ; 7(1): 2489, 2017 05 30.
Artigo em Inglês | MEDLINE | ID: mdl-28559568

RESUMO

Auxin plays a pivotal role in virtually every aspect of plant morphogenesis. It simultaneously orchestrates a diverse variety of processes such as cell wall biogenesis, transition through the cell cycle, or metabolism of a wide range of chemical substances. The coordination principles for such a complex orchestration are poorly understood at the systems level. Here, we perform an RNA-seq experiment to study the transcriptional response to auxin treatment  within gene groups of different biological processes, molecular functions, or cell components in a quantitative fold-change-specific manner. We find for Arabidopsis thaliana roots treated with auxin for 6 h that (i) there are functional groups within which genes respond to auxin with a surprisingly similar fold changes and that (ii) these fold changes vary from one group to another. These findings make it tempting to conjecture the existence of some transcriptional logic orchestrating the coordinated expression of genes within functional groups in a fold-change-specific manner. To obtain some initial insight about this coordinated expression, we performed a motif enrichment analysis and found cis-regulatory elements TBX1-3, SBX, REG, and TCP/site2 as the candidates conferring fold-change-specific responses to auxin in Arabidopsis thaliana.


Assuntos
Arabidopsis/genética , Ácidos Indolacéticos/metabolismo , Raízes de Plantas/genética , Arabidopsis/efeitos dos fármacos , Arabidopsis/metabolismo , Proteínas de Arabidopsis/química , Proteínas de Arabidopsis/genética , Regulação da Expressão Gênica de Plantas/genética , Ácidos Indolacéticos/farmacologia , Raízes de Plantas/crescimento & desenvolvimento , Raízes de Plantas/metabolismo , Dobramento de Proteína/efeitos dos fármacos , Transdução de Sinais/efeitos dos fármacos , Transdução de Sinais/genética
5.
Comput Biol Chem ; 64: 19-32, 2016 10.
Artigo em Inglês | MEDLINE | ID: mdl-27235721

RESUMO

Steroidogenic factor 1 (SF-1) belongs to a small group of the transcription factors that bind DNA only as a monomer. Three different approaches-Sitecon, SiteGA, and oPWM-constructed using the same training sample of experimentally confirmed SF-1 binding sites have been used to recognize these sites. The appropriate prediction thresholds for recognition models have been selected. Namely, the thresholds concordant by false positive or negative rates for various methods were used to optimize the discrimination of steroidogenic gene promoters from the datasets of non-specific promoters. After experimental verification, the models were used to analyze the ChIP-seq data for SF-1. It has been shown that the sets of sites recognized by different models overlap only partially and that an integration of these models allows for identification of SF-1 sites in up to 80% of the ChIP-seq loci. The structures of the sites detected using the three recognition models in the ChIP-seq peaks falling within the [-5000, +5000] region relative to the transcription start sites (TSS) extracted from the FANTOM5 project have been analyzed. The MATLIGN classified the frequency matrices for the sites predicted by oPWM, Sitecon, and SiteGA into two groups. The first group is described by oPWM/Sitecon and the second, by SiteGA. Gene ontology (GO) analysis has been used to clarify the differences between the sets of genes carrying different variants of SF-1 binding sites. Although this analysis in general revealed a considerable overlap in GO terms for the genes carrying the binding sites predicted by oPWM, Sitecon, or SiteGA, only the last method elicited notable trend to terms related to negative regulation and apoptosis. The results suggest that the SF-1 binding sites are different in both their structure and the functional annotation of the set of target genes correspond to the predictions by oPWM+Sitecon and SiteGA. Further application of Homer software for de novo identification of enriched motifs in ChIP-Seq data for SF-1ChIP-seq dataset gave the data similar to oPWM+Sitecon.


Assuntos
Fator Esteroidogênico 1/metabolismo , Animais , Sítios de Ligação , Imunoprecipitação da Cromatina , Ensaio de Desvio de Mobilidade Eletroforética , Masculino , Ratos , Ratos Wistar , Fator Esteroidogênico 1/química
6.
BMC Genomics ; 17: 337, 2016 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-27146967

RESUMO

BACKGROUND: A prominent and distinctive feature of the rye (Secale cereale) chromosomes is the presence of massive blocks of subtelomeric heterochromatin, the size of which is correlated with the copy number of tandem arrays. The rapidity with which these regions have formed over the period of speciation remains unexplained. RESULTS: Using a BAC library created from the short arm telosome of rye chromosome 1R we uncovered numerous arrays of the pSc200 and pSc250 tandem repeat families which are concentrated in subtelomeric heterochromatin and identified the adjacent DNA sequences. The arrays show significant heterogeneity in monomer organization. 454 reads were used to gain a representation of the expansion of these tandem repeats across the whole rye genome. The presence of multiple, relatively short monomer arrays, coupled with the mainly star-like topology of the monomer phylogenetic trees, was taken as indicative of a rapid expansion of the pSc200 and pSc250 arrays. The evolution of subtelomeric heterochromatin appears to have included a significant contribution of illegitimate recombination. The composition of transposable elements (TEs) within the regions flanking the pSc200 and pSc250 arrays differed markedly from that in the genome a whole. Solo-LTRs were strongly enriched, suggestive of a history of active ectopic exchange. Several DNA motifs were over-represented within the LTR sequences. CONCLUSION: The large blocks of subtelomeric heterochromatin have arisen from the combined activity of TEs and the expansion of the tandem repeats. The expansion was likely based on a highly complex network of recombination mechanisms.


Assuntos
Elementos de DNA Transponíveis , Amplificação de Genes , Heterocromatina/genética , Secale/genética , Sequências de Repetição em Tandem , Cromossomos Artificiais Bacterianos , Cromossomos de Plantas/genética , Biblioteca Gênica , Componentes Genômicos , Hibridização in Situ Fluorescente , Análise de Sequência com Séries de Oligonucleotídeos , Filogenia , Análise de Sequência de DNA
7.
J Biomol Struct Dyn ; 31(1): 96-102, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-22803765

RESUMO

Elongation efficiency index (EEI) was suggested earlier to estimate gene expression efficiency by nucleotide context of coding sequence in unicellular organisms. We have analyzed association between EEI and nucleosome formation potential (NFP) in 5' regulatory regions upstream translation initiation site (TIS) from two yeast species. Theoretical estimations of NFP based on DNA sequence were obtained by Recon method. Experimental estimation of nucleosome occupancy was obtained by high-throughput sequencing data of nucleosomal DNA in Saccharomyces cerevisiae . For the sample of all genes correlation coefficient was calculated between two vectors: vector of NFP values for fixed position relative to TIS and vector of EEI values. Profiles of correlation coefficients of NFP and EEI were counted in (-600; +600) regions relative to TIS for gene sequences extracted from GenBank. We found regions of strong negative dependence between NFP and EEI for all genes as well as for 10% highly expressed genes in Schizosaccharomyces pombe (10% of EEI-highest genes). At the same time, we found positive dependence between NFP and EEI for all genes and for low expressed genes in S. cerevisiae (10% of EEI-lowest genes). The association between NFP and EEI could be explained by evolutionary selection of context characteristics of nucleotide sequences for gene expression optimization.


Assuntos
Nucleossomos/metabolismo , Iniciação Traducional da Cadeia Peptídica , Regiões Promotoras Genéticas , Saccharomyces cerevisiae/genética , Schizosaccharomyces/genética , DNA Fúngico/química , DNA Fúngico/metabolismo , Genes Fúngicos , Sequências Reguladoras de Ácido Nucleico , Saccharomyces cerevisiae/metabolismo , Schizosaccharomyces/metabolismo
9.
Biochemistry (Mosc) ; 72(11): 1187-93, 2007 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-18205600

RESUMO

The development of computer-assisted methods for transcription factor binding sites (TFBS) recognition is necessary for study the DNA regulatory transcription code. There are a great number of experimental methods that enable TFBS identification in genome sequences. The experimental data can be used to elaborate multiple computer approaches to recognition of TFBS, each of which has its own advantages and limitations. A short review of the characteristics of computer methods of TFBS prediction based on various principles is presented. Methods used for experimental monitoring of predicted sites are analyzed. Data concerning DNA regulatory potential and its realization at the chromatin level, obtained using these methods, are discussed along with approaches to recognition of target genes of certain transcription factors in the genome sequences.


Assuntos
Biologia Computacional , Fatores de Transcrição/metabolismo , Vertebrados/genética , Vertebrados/metabolismo , Animais , Sítios de Ligação/genética , Simulação por Computador , DNA/genética , DNA/metabolismo , Genoma , Humanos
10.
Bioinformatics ; 17(11): 998-1010, 2001 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-11724728

RESUMO

MOTIVATION: A rapid growth in the number of genes with known sequences calls for developing automated tools for their classification and analysis. It became clear that nucleosome packaging of eukaryotic DNA is very important for gene functioning. Automated computer tools for characterization of nucleosome packaging density could be useful for studying of gene regulation and genome annotation. RESULTS: A program for constructing nucleosome formation potential profiles of eukaryotic DNA sequences was developed. Nucleosome packaging density was analyzed for different functional types of human promoters. It was found that in promoters of tissue-specific genes, the nucleosome formation potential was essentially higher than in genes expressed in many tissues, or housekeeping genes. Hence, capability of nucleosome positioning in the promoter region may serve as a factor regulating gene expression. AVAILABILITY: The program for nucleosome sites recognition is included into the GeneExpress system; section 'DNA Nucleosomal Organization', http://wwwmgs.bionet.nsc.ru/mgs/programs/recon/.


Assuntos
DNA/genética , Nucleossomos/genética , Software , Algoritmos , Composição de Bases , Biologia Computacional , DNA/química , Células Eucarióticas , Expressão Gênica , Histonas/química , Histonas/genética , Regiões Promotoras Genéticas
11.
Bioinformatics ; 17(11): 1062-4, 2001 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-11724736

RESUMO

A program for constructing nucleosome formation potential profile was applied for investigation of exons, introns, and repetitive sequences. The program is available at http://wwwmgs.bionet.nsc.ru/mgs/programs/recon/. We have demonstrated that introns and repetitive sequences exhibit higher nucleosome formation potentials than exons. This fact may be explained by functional saturation of exons with genetic code, hindering the localization of efficient nucleosome positioning sites.


Assuntos
Elementos Alu , Éxons , Íntrons , Nucleossomos/genética , Software , Biologia Computacional , Bases de Dados de Ácidos Nucleicos , Humanos
12.
Bioinformatics ; 15(7-8): 582-92, 1999.
Artigo em Inglês | MEDLINE | ID: mdl-10487866

RESUMO

MOTIVATION: Chromatin structure plays the crucial role in proper gene functioning. Therefore, it is very important to investigate nucleosomal DNA properties and recognize genome nucleosome positioning sequences. Nevertheless, applying different sequence analysis methods separately is insufficient for complete nucleosomal DNA description. One of the most probable reasons for that is the weakness of nucleosome positioning signals. The present paper offers a set of methods to reveal the most important nucleosomal DNA characteristics and to show a common pattern of nucleosome site properties. RESULTS: A complex approach was used to determine conformational and physicochemical properties that are most significant for nucleosome binding site description. The integrated database of nucleosomal DNA properties is compiled. This database comprises different sections for description of DNA characteristics. Revealing significant DNA characteristics allows the classification of various samples of site sequences and the generation of programs for site recognition. AVAILABILITY: The current version of the database is available at http://wwwmgs.bionet.nsc. ru/system/BDNAvideo/. C-code of the recognition program may be found in the section FEATURE. WWW-available programs for testing arbitrary sequences are accessible at http://wwwmgs.bionet.nsc. ru/Programs/bDNA/NA_bDNA.htm/. The links to the mirror site(s) can be found at http://wwwmgs.bionet.nsc.ru/mgs/links/mirrors.html+ ++.


Assuntos
DNA/química , DNA/genética , Bases de Dados Factuais , Nucleossomos/genética , Algoritmos , Animais , Sítios de Ligação/genética , Fenômenos Químicos , Físico-Química , DNA/metabolismo , Humanos , Nucleossomos/metabolismo , Software
13.
Bioinformatics ; 15(7-8): 644-53, 1999.
Artigo em Inglês | MEDLINE | ID: mdl-10487872

RESUMO

MOTIVATION: Despite the growing volume of data on primary nucleotide sequences, the regulatory regions remain a major puzzle with regard to their function. Numerous recognising programs considering a diversity of properties of regulatory regions have been developed. The system proposed here allows the specific contextual, conformational and physico-chemical properties to be revealed based on analysis of extended DNA regions. RESULTS: The Internet-accessible computer system RegScan, designed to analyse the extended regulatory regions of eukaryotic genes, has been developed. The computer system comprises the following software: (i) programs for classification dividing a set of promoters into TATA-containing and TATA-less promoters and promoters with and without CpG islands; (ii) programs for constructing (a) nucleotide frequency profiles, (b) sequence complexity profiles and (c) profiles of conformational and physico-chemical properties; (iii) the program for constructing the sets of degenerate oligonucleotide motifs of a specified length; and (iv) the program searching for and visualising repeats in nucleotide sequences. The system has allowed us to demonstrate the following characteristic patterns of vertebrate promoter regions: the TATA box region is flanked by regions with an increased G+C content and increased bending stiffness, the TATA box content is asymmetric and promoter regions are saturated with both direct and inverted repeats. AVAILABILITY: The computer system RegScan is available via the Internet at http://www.mgs.bionet.nsc. ru/Systems/RegScan, http://www.cbil.upenn.edu/mgs/systems/r egscan/.


Assuntos
Sistemas Computacionais , DNA/genética , Genes Reguladores , Algoritmos , Animais , Sequência de Bases , Ilhas de CpG , Bases de Dados Factuais , Internet , Regiões Promotoras Genéticas , Software , TATA Box
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...